Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 52
Filtrar
1.
Methods ; 170: 38-47, 2020 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-31442560

RESUMO

Chromosome Conformation Capture (3C)-based technologies, such as Hi-C, have represented a significant breakthrough in investigating the structure and function of higher-order genome architecture. However, the mapping of global chromatin interactions remains challenging across many biological conditions due to high background noise and financial constraints, especially for small laboratories. Here, we describe the Bridge linker-Alul-Tn5 Hi-C (BAT Hi-C) method, which is a simple and efficient method for delineating chromatin conformational features of mouse embryonic stem (mES) cells and uncover DNA loops. This protocol combines Alul fragmentation and biotinylated linker-mediated proximity ligation to obtain kilobase (kb) resolution with a marked increase in the amount of unique read pairs. The protocol also includes chromatin isolation to reduce background noise and Tn5 tagmentation to cut down on preparation time. Importantly, with only one-third sequencing depth, our method revealed the same spectrum of chromatin contacts as in situ Hi-C. BAT Hi-C is an economical (i.e., approximately $40 for library preparation) and straightforward (total hands-on time of 3 days) tool that is ideal for the in-depth analysis of long-range chromatin looping events in a genome-wide fashion.


Assuntos
Cromatina/genética , Mapeamento Cromossômico/métodos , Genômica/métodos , Animais , Linhagem Celular , Núcleo Celular/genética , Cromatina/isolamento & purificação , Cromatina/metabolismo , Mapeamento Cromossômico/economia , Desoxirribonucleases de Sítio Específico do Tipo II/metabolismo , Células-Tronco Embrionárias , Biblioteca Gênica , Genômica/economia , Camundongos , Transposases/metabolismo
2.
Methods ; 170: 48-60, 2020 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-31252062

RESUMO

Studies performed using Hi-C and other high-throughput whole-genome C-methods have demonstrated that 3D organization of eukaryotic genomes is functionally relevant. Unfortunately, ultra-deep sequencing of Hi-C libraries necessary to detect loop structures in large vertebrate genomes remains rather expensive. However, many studies are in fact aimed at determining the fine-scale 3D structure of comparatively small genomic regions up to several Mb in length. Such studies typically focus on the spatial structure of domains of coregulated genes, molecular mechanisms of loop formation, and interrogation of functional significance of GWAS-revealed polymorphisms. Therefore, a handful of molecular techniques based on Hi-C have been developed to address such issues. These techniques commonly rely on in-solution hybridization of Hi-C/3C-seq libraries with pools of biotinylated baits covering the region of interest, followed by deep sequencing of the enriched library. Here, we describe a new protocol of this kind, C-TALE (Chromatin TArget Ligation Enrichment). Preparation of hybridization probes from bacterial artificial chromosomes and an additional round of enrichment make C-TALE a cost-effective alternative to existing many-versus-all C-methods.


Assuntos
Mapeamento Cromossômico/métodos , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Animais , Biotinilação , Linhagem Celular , Cromatina/química , Cromatina/genética , Cromatina/isolamento & purificação , Cromatina/metabolismo , Mapeamento Cromossômico/economia , Cromossomos Artificiais Bacterianos/genética , DNA/genética , DNA/isolamento & purificação , DNA/metabolismo , Biblioteca Gênica , Genômica/economia , Sequenciamento de Nucleotídeos em Larga Escala/economia , Humanos , Conformação de Ácido Nucleico , Hibridização de Ácido Nucleico/métodos
3.
Proc Natl Acad Sci U S A ; 116(37): 18507-18516, 2019 09 10.
Artigo em Inglês | MEDLINE | ID: mdl-31451639

RESUMO

Genome-wide phenotypic screens provide an unbiased way to identify genes involved in particular biological traits, and have been widely used in lower model organisms. However, cost and time have limited the utility of such screens to address biological and disease questions in mammals. Here we report a highly efficient piggyBac (PB) transposon-based first-generation (F1) dominant screening system in mice that enables an individual investigator to conduct a genome-wide phenotypic screen within a year with fewer than 300 cages. The PB screening system uses visually trackable transposons to induce both gain- and loss-of-function mutations and generates genome-wide distributed new insertions in more than 55% of F1 progeny. Using this system, we successfully conducted a pilot F1 screen and identified 5 growth retardation mutations. One of these mutants, a Six1/4 PB/+ mutant, revealed a role in milk intake behavior. The mutant animals exhibit abnormalities in nipple recognition and milk ingestion, as well as developmental defects in cranial nerves V, IX, and X. This PB F1 screening system offers individual laboratories unprecedented opportunities to conduct affordable genome-wide phenotypic screens for deciphering the genetic basis of mammalian biology and disease pathogenesis.


Assuntos
Mapeamento Cromossômico/métodos , Elementos de DNA Transponíveis/genética , Genoma , Técnicas de Genotipagem/métodos , Mutagênese Insercional/métodos , Animais , Animais Recém-Nascidos , Mapeamento Cromossômico/economia , Modelos Animais de Doenças , Embrião de Mamíferos , Estudos de Viabilidade , Feminino , Retardo do Crescimento Fetal/genética , Fibroblastos , Técnicas de Genotipagem/economia , Humanos , Masculino , Camundongos/genética , Camundongos Transgênicos , Mutagênese Insercional/economia , Mutação , Fenótipo , Cultura Primária de Células
4.
Nat Commun ; 10(1): 2680, 2019 06 18.
Artigo em Inglês | MEDLINE | ID: mdl-31213597

RESUMO

Genetic studies of complex traits in animals have been hindered by the need to generate, maintain, and phenotype large panels of recombinant lines. We developed a new method, C. elegans eXtreme Quantitative Trait Locus (ceX-QTL) mapping, that overcomes this obstacle via bulk selection on millions of unique recombinant individuals. We use ceX-QTL to map a drug resistance locus with high resolution. We also map differences in gene expression in live worms and discovered that mutations in the co-chaperone sti-1 upregulate the transcription of HSP-90. Lastly, we use ceX-QTL to map loci that influence fitness genome-wide confirming previously reported causal variants and uncovering new fitness loci. ceX-QTL is fast, powerful and cost-effective, and will accelerate the study of complex traits in animals.


Assuntos
Caenorhabditis elegans/genética , Mapeamento Cromossômico/métodos , Aptidão Genética/genética , Locos de Características Quantitativas/genética , Característica Quantitativa Herdável , Animais , Mapeamento Cromossômico/economia , Resistência a Medicamentos/genética , Feminino , Regulação da Expressão Gênica/genética , Masculino , Fatores de Tempo
5.
Exp Cell Res ; 358(2): 433-438, 2017 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-28017728

RESUMO

Genomic information has become a ubiquitous and almost essential aspect of biological research. Over the last 10-15 years, the cost of generating sequence data from DNA or RNA samples has dramatically declined and our ability to interpret those data increased just as remarkably. Although it is still possible for biologists to conduct interesting and valuable research on species for which genomic data are not available, the impact of having access to a high quality whole genome reference assembly for a given species is nothing short of transformational. Research on a species for which we have no DNA or RNA sequence data is restricted in fundamental ways. In contrast, even access to an initial draft quality genome (see below for definitions) opens a wide range of opportunities that are simply not available without that reference genome assembly. Although a complete discussion of the impact of genome sequencing and assembly is beyond the scope of this short paper, the goal of this review is to summarize the most common and highest impact contributions that whole genome sequencing and assembly has had on comparative and evolutionary biology.


Assuntos
Sequência de Bases/genética , Mapeamento Cromossômico , Biologia Computacional , Genoma/genética , Animais , Mapeamento Cromossômico/economia , Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Análise de Sequência de DNA/métodos
6.
Artigo em Alemão | MEDLINE | ID: mdl-27999872

RESUMO

BACKGROUND: The diagnostic use of whole-genome sequencing (WGS) is a growing issue in medical care. Due to limited resources in public health service, budget-impact analyses are necessary prior to implementation. OBJECTIVE: A budget-impact analysis for WGS of all newborns and diagnostic investigation of tumor patients in different oncologic indications were evaluated. METHODS: A cost analysis of WGS based on a quality-assured process chart for WGS at the German Cancer Research Center (DKFZ), Heidelberg, constitutes the basis for this evaluation. Data from the National Association of Statutory Health Insurance Funds and the Robert-Koch-Institute, Berlin, were used for calculations of specific clinical applications. RESULTS AND DISCUSSION: WGS in newborn screening leads to costs of € 2.85 bn and to an increase of total expenditure by 1.41%. Sequencing of all tumor patients would cost approximately € 0.84 bn, which corresponds to 0.42% of total expenditures. In all scenarios, the sole consideration of procedure costs results in increasing costs. However, in cost discussions potential savings (reduction of disease-related follow-up-costs, improved cost-effectiveness of medical measures etc.) should be considered. Such considerations are the subject of economic indication-specific evaluations. WGS has the potential to generate a large number of deterministic findings for which treatment options are limited. Hence, it is necessary to limit indications, in which WGS has proven medical evidence.


Assuntos
Mapeamento Cromossômico/economia , Testes Genéticos/economia , Custos de Cuidados de Saúde/estatística & dados numéricos , Sequenciamento de Nucleotídeos em Larga Escala/economia , Triagem Neonatal/economia , Padrões de Prática Médica/economia , Mapeamento Cromossômico/estatística & dados numéricos , Efeitos Psicossociais da Doença , Testes Genéticos/estatística & dados numéricos , Alemanha/epidemiologia , Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Humanos , Recém-Nascido , Triagem Neonatal/estatística & dados numéricos , Padrões de Prática Médica/estatística & dados numéricos
7.
Sci Rep ; 6: 23087, 2016 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-26976328

RESUMO

Microsatellites have a wide range of applications from behavioral biology, evolution, to agriculture-based breeding programs. The recent progress in the next-generation sequencing technologies and the rapidly increasing number of published genomes may greatly enhance the current applications of microsatellites by turning them from anonymous to informative markers. Here we developed an approach to anchor microsatellite markers of any target species in a genome of a related model species, through which the genomic locations of the markers, along with any functional genes potentially linked to them, can be revealed. We mapped the shotgun sequence reads of a non-model rodent species Apodemus semotus against the genome of a model species, Mus musculus, and presented 24 polymorphic microsatellite markers with detailed background information for A. semotus in this study. The developed markers can be used in other rodent species, especially those that are closely related to A. semotus or M. musculus. Compared to the traditional approaches based on DNA cloning, our approach is likely to yield more loci for the same cost. This study is a timely demonstration of how a research team can efficiently generate informative (neutral or function-associated) microsatellite markers for their study species and unique biological questions.


Assuntos
Mapeamento Cromossômico/métodos , Genoma/genética , Camundongos/genética , Repetições de Microssatélites/genética , Murinae/genética , Animais , Mapeamento Cromossômico/economia , Análise Custo-Benefício , Feminino , Ligação Genética , Genômica/economia , Genômica/métodos , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala/economia , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Reprodutibilidade dos Testes , Especificidade da Espécie
8.
BMC Genomics ; 15: 1158, 2014 Dec 22.
Artigo em Inglês | MEDLINE | ID: mdl-25534138

RESUMO

BACKGROUND: Cucumber, Cucumis sativus L., is an economically important vegetable crop which is processed or consumed fresh worldwide. However, the narrow genetic base in cucumber makes it difficult for constructing high-density genetic maps. The development of massively parallel genotyping methods and next-generation sequencing (NGS) technologies provides an excellent opportunity for developing single nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of horticultural traits. Specific-length amplified fragment sequencing (SLAF-seq) is a recent marker development technology that allows large-scale SNP discovery and genotyping at a reasonable cost. In this study, we constructed a high-density SNP map for cucumber using SLAF-seq and detected fruit-related QTLs. RESULTS: An F2 population of 148 individuals was developed from an intra-varietal cross between CC3 and NC76. Genomic DNAs extracted from two parents and 148 F2 individuals were subjected to high-throughput sequencing and SLAF library construction. A total of 10.76 Gb raw data and 75,024,043 pair-end reads were generated to develop 52,684 high-quality SLAFs, out of which 5,044 were polymorphic. 4,817 SLAFs were encoded and grouped into different segregation patterns. A high-resolution genetic map containing 1,800 SNPs was constructed for cucumber spanning 890.79 cM. The average distance between adjacent markers was 0.50 cM. 183 scaffolds were anchored to the SNP-based genetic map covering 46% (168.9 Mb) of the cucumber genome (367 Mb). Nine QTLs for fruit length and weight were detected, a QTL designated fl3.2 explained 44.60% of the phenotypic variance. Alignment of the SNP markers to draft genome scaffolds revealed two mis-assembled scaffolds that were validated by fluorescence in situ hybridization (FISH). CONCLUSIONS: We report herein the development of evenly dispersed SNPs across cucumber genome, and for the first time an SNP-based saturated linkage map. This 1,800-locus map would likely facilitate genetic mapping of complex QTL loci controlling fruit yield, and the orientation of draft genome scaffolds.


Assuntos
Mapeamento Cromossômico/métodos , Cucumis sativus/genética , Frutas/anatomia & histologia , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Análise de Sequência de DNA/métodos , Mapeamento Cromossômico/economia , Análise Custo-Benefício , Cucumis sativus/anatomia & histologia , Frutas/genética , Técnicas de Genotipagem , Sequenciamento de Nucleotídeos em Larga Escala/economia , Tamanho do Órgão , Análise de Sequência de DNA/economia
9.
BMC Genomics ; 15: 433, 2014 Jun 04.
Artigo em Inglês | MEDLINE | ID: mdl-24898122

RESUMO

BACKGROUND: Understanding genetic control of tassel and ear architecture in maize (Zea mays L. ssp. mays) is important due to their relationship with grain yield. High resolution QTL mapping is critical for understanding the underlying molecular basis of phenotypic variation. Advanced populations, such as recombinant inbred lines, have been broadly adopted for QTL mapping; however, construction of large advanced generation crop populations is time-consuming and costly. The rapidly declining cost of genotyping due to recent advances in next-generation sequencing technologies has generated new possibilities for QTL mapping using large early generation populations. RESULTS: A set of 708 F2 progeny derived from inbreds Chang7-2 and 787 were generated and genotyped by whole genome low-coverage genotyping-by-sequencing method (average 0.04×). A genetic map containing 6,533 bin-markers was constructed based on the parental SNPs and a sliding-window method, spanning a total genetic distance of 1,396 cM. The high quality and accuracy of this map was validated by the identification of two well-studied genes, r1, a qualitative trait locus for color of silk (chromosome 10) and ba1 for tassel branch number (chromosome 3). Three traits of tassel and ear architecture were evaluated in this population, a total of 10 QTL were detected using a permutation-based-significance threshold, seven of which overlapped with reported QTL. Three genes (GRMZM2G316366, GRMZM2G492156 and GRMZM5G805008) encoding MADS-box domain proteins and a BTB/POZ domain protein were located in the small intervals of qTBN5 and qTBN7 (~800 Kb and 1.6 Mb in length, respectively) and may be involved in patterning of tassel architecture. The small physical intervals of most QTL indicate high-resolution mapping is obtainable with this method. CONCLUSIONS: We constructed an ultra-high-dentisy linkage map for the large early generation population in maize. Our study provides an efficient approach for fast detection of quantitative loci responsible for complex trait variation with high accuracy, thus helping to dissect the underlying molecular basis of phenotypic variation and accelerate improvement of crop breeding in a cost-effective fashion.


Assuntos
Mapeamento Cromossômico/métodos , Inflorescência/genética , Zea mays/anatomia & histologia , Zea mays/crescimento & desenvolvimento , Mapeamento Cromossômico/economia , Cromossomos de Plantas , DNA de Plantas/genética , Fenótipo , Locos de Características Quantitativas , Característica Quantitativa Herdável , Análise de Sequência de DNA , Zea mays/genética
10.
DNA Res ; 19(6): 449-61, 2012 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-23103470

RESUMO

Single-nucleotide polymorphisms (SNPs, >2000) were discovered by using RNA-seq and allele-specific sequencing approaches in pigeonpea (Cajanus cajan). For making the SNP genotyping cost-effective, successful competitive allele-specific polymerase chain reaction (KASPar) assays were developed for 1616 SNPs and referred to as PKAMs (pigeonpea KASPar assay markers). Screening of PKAMs on 24 genotypes [23 from cultivated species and 1 wild species (Cajanus scarabaeoides)] defined a set of 1154 polymorphic markers (77.4%) with a polymorphism information content (PIC) value from 0.04 to 0.38. One thousand and ninety-four PKAMs showed polymorphisms between parental lines of the reference mapping population (C. cajan ICP 28 × C. scarabaeoides ICPW 94). By using high-quality marker genotyping data on 167 F(2) lines from the population, a comprehensive genetic map comprising 875 PKAMs with an average inter-marker distance of 1.11 cM was developed. Previously mapped 35 simple sequence repeat markers were integrated into the PKAM map and an integrated genetic map of 996.21 cM was constructed. Mapped PKAMs showed a higher degree of synteny with the genome of Glycine max followed by Medicago truncatula and Lotus japonicus and least with Vigna unguiculata. These PKAMs will be useful for genetics research and breeding applications in pigeonpea and for utilizing genome information from other legume species.


Assuntos
Cajanus/genética , Mapeamento Cromossômico/métodos , Fabaceae/genética , Genômica/métodos , Polimorfismo de Nucleotídeo Único/genética , Alelos , Sequência de Bases , Mapeamento Cromossômico/economia , Análise Custo-Benefício , Frequência do Gene , Ligação Genética , Marcadores Genéticos/genética , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites/genética , Dados de Sequência Molecular , Análise de Sequência de DNA , Especificidade da Espécie , Sintenia/genética
12.
Plant J ; 67(4): 736-45, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21554453

RESUMO

Common variants, such as those identified by genome-wide association scans, explain only a small proportion of trait variation. Growing evidence suggests that rare functional variants, which are usually missed by genome-wide association scans, play an important role in determining the phenotype. We used pooled multiplexed next-generation sequencing and a customized analysis workflow to detect mutations in five candidate genes for lignin biosynthesis in 768 pooled Populus nigra accessions. We identified a total of 36 non-synonymous single nucleotide polymorphisms, one of which causes a premature stop codon. The most common variant was estimated to be present in 672 of the 1536 tested chromosomes, while the rarest was estimated to occur only once in 1536 chromosomes. Comparison with individual Sanger sequencing in a selected sub-sample confirmed that variants are identified with high sensitivity and specificity, and that the variant frequency was estimated accurately. This proposed method for identification of rare polymorphisms allows accurate detection of variation in many individuals, and is cost-effective compared to individual sequencing.


Assuntos
Mapeamento Cromossômico/métodos , Variação Genética/genética , Genoma de Planta/genética , Estudo de Associação Genômica Ampla/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Populus/genética , Alelos , Sequência de Bases , Mapeamento Cromossômico/economia , Cromossomos de Plantas/genética , Estudo de Associação Genômica Ampla/economia , Genótipo , Lignina/biossíntese , Mutação , Fenótipo , Proteínas de Plantas/genética , Polimorfismo de Nucleotídeo Único/genética , Sensibilidade e Especificidade , Análise de Sequência de DNA
13.
J Comput Biol ; 18(3): 237-49, 2011 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-21385031

RESUMO

Abstract Admixture mapping is a gene mapping approach used for the identification of genomic regions harboring disease susceptibility genes in the case of recently admixed populations such as African Americans. We present a novel method for admixture mapping, called admixture aberration analysis (AAA) that uses a DNA pool of affected admixed individuals. We demonstrate through simulations that AAA is a powerful and economical mapping method under a range of scenarios, capturing complex human diseases such as hypertension and end-stage kidney disease. The method has a low false-positive rate and is robust to deviation from model assumptions. Finally, we apply AAA on 600 prostate cancer-affected African Americans, replicating a known risk locus. Simulation results indicate that the method can yield over 96% reduction in genotyping. Our method is implemented as a Java program called AAAmap and is freely available at http://bioinfo.cs.technion.ac.il/AAAmap.


Assuntos
Mapeamento Cromossômico/métodos , DNA/genética , Predisposição Genética para Doença , Neoplasias da Próstata/genética , Negro ou Afro-Americano/genética , Mapeamento Cromossômico/economia , Simulação por Computador , Humanos , Hipertensão/genética , Falência Renal Crônica/genética , Desequilíbrio de Ligação , Masculino , Modelos Genéticos
14.
BMC Genomics ; 11: 570, 2010 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-20950475

RESUMO

BACKGROUND: Expressed Sequence Tags (ESTs) are a source of simple sequence repeats (SSRs) that can be used to develop molecular markers for genetic studies. The availability of ESTs for Quercus robur and Quercus petraea provided a unique opportunity to develop microsatellite markers to accelerate research aimed at studying adaptation of these long-lived species to their environment. As a first step toward the construction of a SSR-based linkage map of oak for quantitative trait locus (QTL) mapping, we describe the mining and survey of EST-SSRs as well as a fast and cost-effective approach (bin mapping) to assign these markers to an approximate map position. We also compared the level of polymorphism between genomic and EST-derived SSRs and address the transferability of EST-SSRs in Castanea sativa (chestnut). RESULTS: A catalogue of 103,000 Sanger ESTs was assembled into 28,024 unigenes from which 18.6% presented one or more SSR motifs. More than 42% of these SSRs corresponded to trinucleotides. Primer pairs were designed for 748 putative unigenes. Overall 37.7% (283) were found to amplify a single polymorphic locus in a reference full-sib pedigree of Quercus robur. The usefulness of these loci for establishing a genetic map was assessed using a bin mapping approach. Bin maps were constructed for the male and female parental tree for which framework linkage maps based on AFLP markers were available. The bin set consisting of 14 highly informative offspring selected based on the number and position of crossover sites. The female and male maps comprised 44 and 37 bins, with an average bin length of 16.5 cM and 20.99 cM, respectively. A total of 256 EST-SSRs were assigned to bins and their map position was further validated by linkage mapping. EST-SSRs were found to be less polymorphic than genomic SSRs, but their transferability rate to chestnut, a phylogenetically related species to oak, was higher. CONCLUSION: We have generated a bin map for oak comprising 256 EST-SSRs. This resource constitutes a first step toward the establishment of a gene-based map for this genus that will facilitate the dissection of QTLs affecting complex traits of ecological importance.


Assuntos
Mapeamento Cromossômico/economia , Mapeamento Cromossômico/métodos , Etiquetas de Sequências Expressas , Marcadores Genéticos , Repetições Minissatélites/genética , Quercus/genética , Análise Custo-Benefício , Mineração de Dados , Genoma de Planta/genética , Repetições de Microssatélites/genética , Polimorfismo Genético
15.
Genet Epidemiol ; 34(5): 479-91, 2010 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-20552648

RESUMO

Most common hereditary diseases in humans are complex and multifactorial. Large-scale genome-wide association studies based on SNP genotyping have only identified a small fraction of the heritable variation of these diseases. One explanation may be that many rare variants (a minor allele frequency, MAF <5%), which are not included in the common genotyping platforms, may contribute substantially to the genetic variation of these diseases. Next-generation sequencing, which would allow the analysis of rare variants, is now becoming so cheap that it provides a viable alternative to SNP genotyping. In this paper, we present cost-effective protocols for using next-generation sequencing in association mapping studies based on pooled and un-pooled samples, and identify optimal designs with respect to total number of individuals, number of individuals per pool, and the sequencing coverage. We perform a small empirical study to evaluate the pooling variance in a realistic setting where pooling is combined with exon-capturing. To test for associations, we develop a likelihood ratio statistic that accounts for the high error rate of next-generation sequencing data. We also perform extensive simulations to determine the power and accuracy of this method. Overall, our findings suggest that with a fixed cost, sequencing many individuals at a more shallow depth with larger pool size achieves higher power than sequencing a small number of individuals in higher depth with smaller pool size, even in the presence of high error rates. Our results provide guidelines for researchers who are developing association mapping studies based on next-generation sequencing.


Assuntos
Mapeamento Cromossômico/métodos , Genética Populacional/métodos , Estudo de Associação Genômica Ampla/métodos , Modelos Genéticos , Projetos de Pesquisa , Alelos , Mapeamento Cromossômico/economia , Simulação por Computador , Dinamarca , Predisposição Genética para Doença , Variação Genética , Genética Populacional/economia , Estudo de Associação Genômica Ampla/economia , Genótipo , Humanos , Modelos Estatísticos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/economia , Análise de Sequência de DNA/métodos
16.
BMC Genomics ; 11: 301, 2010 May 12.
Artigo em Inglês | MEDLINE | ID: mdl-20462427

RESUMO

BACKGROUND: The high-throughput anchoring of genetic markers into contigs is required for many ongoing physical mapping projects. Multidimentional BAC pooling strategies for PCR-based screening of large insert libraries is a widely used alternative to high density filter hybridisation of bacterial colonies. To date, concerns over reliability have led most if not all groups engaged in high throughput physical mapping projects to favour BAC DNA isolation prior to amplification by conventional PCR. RESULTS: Here, we report the first combined use of Multiplex Tandem PCR (MT-PCR) and High Resolution Melt (HRM) analysis on bacterial stocks of BAC library superpools as a means of rapidly anchoring markers to BAC colonies and thereby to integrate genetic and physical maps. We exemplify the approach using a BAC library of the model plant Arabidopsis thaliana. Super pools of twenty five 384-well plates and two-dimension matrix pools of the BAC library were prepared for marker screening. The entire procedure only requires around 3 h to anchor one marker. CONCLUSIONS: A pre-amplification step during MT-PCR allows high multiplexing and increases the sensitivity and reliability of subsequent HRM discrimination. This simple gel-free protocol is more reliable, faster and far less costly than conventional PCR screening. The option to screen in parallel 3 genetic markers in one MT-PCR-HRM reaction using templates from directly pooled bacterial stocks of BAC-containing bacteria further reduces time for anchoring markers in physical maps of species with large genomes.


Assuntos
Arabidopsis/genética , Mapeamento Cromossômico/métodos , Mapeamento Cromossômico/economia , Cromossomos Artificiais Bacterianos/genética , Biblioteca Gênica
17.
PLoS Comput Biol ; 5(7): e1000432, 2009 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-19593373

RESUMO

The goal of human genome re-sequencing is obtaining an accurate assembly of an individual's genome. Recently, there has been great excitement in the development of many technologies for this (e.g. medium and short read sequencing from companies such as 454 and SOLiD, and high-density oligo-arrays from Affymetrix and NimbelGen), with even more expected to appear. The costs and sensitivities of these technologies differ considerably from each other. As an important goal of personal genomics is to reduce the cost of re-sequencing to an affordable point, it is worthwhile to consider optimally integrating technologies. Here, we build a simulation toolbox that will help us optimally combine different technologies for genome re-sequencing, especially in reconstructing large structural variants (SVs). SV reconstruction is considered the most challenging step in human genome re-sequencing. (It is sometimes even harder than de novo assembly of small genomes because of the duplications and repetitive sequences in the human genome.) To this end, we formulate canonical problems that are representative of issues in reconstruction and are of small enough scale to be computationally tractable and simulatable. Using semi-realistic simulations, we show how we can combine different technologies to optimally solve the assembly at low cost. With mapability maps, our simulations efficiently handle the inhomogeneous repeat-containing structure of the human genome and the computational complexity of practical assembly algorithms. They quantitatively show how combining different read lengths is more cost-effective than using one length, how an optimal mixed sequencing strategy for reconstructing large novel SVs usually also gives accurate detection of SNPs/indels, how paired-end reads can improve reconstruction efficiency, and how adding in arrays is more efficient than just sequencing for disentangling some complex SVs. Our strategy should facilitate the sequencing of human genomes at maximum accuracy and low cost.


Assuntos
Genômica/métodos , Modelos Genéticos , Mapeamento Cromossômico/economia , Mapeamento Cromossômico/métodos , Simulação por Computador , Bases de Dados Genéticas , Genômica/economia , Modelos Estatísticos , Análise de Sequência com Séries de Oligonucleotídeos , Análise de Sequência de DNA/economia , Análise de Sequência de DNA/métodos , Software
19.
Genetica ; 135(3): 267-81, 2009 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-18506582

RESUMO

We present a cost-effective DNA pooling strategy for fine mapping of a single Mendelian gene in controlled crosses. The theoretical argument suggests that it is potentially possible for a single-stage pooling approach to reduce the overall experimental expense considerably by balancing costs for genotyping and sample collection. Further, the genotyping burden can be reduced through multi-stage pooling. Numerical results are provided for practical guidelines. For example, the genotyping effort can be reduced to only a small fraction of that needed for individual genotyping at a small loss of estimation accuracy or at a cost of increasing sample sizes slightly when recombination rates are 0.5% or less. An optimal two-stage pooling scheme can reduce the amount of genotyping to 19.5%, 14.5% and 6.4% of individual genotyping efforts for identifying a gene within 1, 0.5, and 0.1 cM, respectively. Finally, we use a genetic data set for mapping the rice xl(t) gene to demonstrate the feasibility and efficiency of the DNA pooling strategy. Taken together, the results demonstrate that this DNA pooling strategy can greatly reduce the genotyping burden and the overall cost in fine mapping experiments.


Assuntos
Mapeamento Cromossômico/métodos , DNA/química , Mapeamento Cromossômico/economia , Pool Gênico , Genótipo , Modelos Genéticos , Oryza/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...